FIG. 8 

-GAM PRECURSOR DNA 



GAM TARGET GENE 




FIG. 9 

^102 



EXPRESSED 
RNA DATA 



SEQUENCED 
DNA DATA 



PROTEIN 
FUNCnONDATA 



-110 



BIOINFORMATIC 
OLIGONUCLEOTIDE 
DETECTION ENGINE 
TRAINING & 
VALIDATION 
FUNCTIONALLITY 



BIOINFORMATIC OLIGONUCLEOTIDE 


DETECTION ENGINE 

^112 




NON-CODING GENOMIC SEQUENCE 






DETECTOR 








HAIRPIN 






DETECTOR 




i s-116 




DICER-CUT LOCATION 






DETECTOR 








TARGET GENE BINDING-SITE 






DETECTOR 




i ^120 




FUNCTION & UTILITY 






ANALYZER 




1 



BIOINFORMATICALLY DETECTED 
GROUP OF NOVEL 
OLIGONUCLEOTIDES 



FIG. 10 

^110 

BIOINFORMATIC 
OLIGONUCLEOTIDE DETECTION ENGINE 
TRAINING & VALIDATION FUNCTIONALLITY 1 00 

. cl? 2 

TRAIN ENGINE TO RECOGNIZE KNOWN 



HAIRPIN DETECTOR 
TRAINING & VALIDATION 
FUNCTIONALLITY 



^126 

DICER-CUT LOCATION DETECTOR 
TRAINING & VALIDATION FUNCTIONALITY 



^128 

TARGET GENE BINDING-SITE DETECTOR 
TRAINING & VALIDATION FUNCTIONALITY 





r ^130 


DETECT NOVEL OLIGONUCLEOTIDES 







VALIDATE EXPRESSION & FUNCTION 
OF SAMPLE OLIGONUCLEOTIDES BY WET- 
LAB 



FIG. 11A 



EXPRESSED 
RNADATA 



SEQUENCED 
DNA DATA 



NON-PROTEIN CODING 
GENOMIC SEQUENCE 
DETECTOR 



NON-PROTEIN CODING 
GENOMIC SEQUENCES 



J 



FIG. 11B 



GET RNA SEQUENCES 



SELECT 
NON-PROTErN CODING 
SEQUENCES 



LOCATE ON DNA 
(BLAST) 

, I 



1 



EXPAND TO 
TRANSCRIPTION START & 
END SIGNALS 



GET DNA SEQUENCES 






PARSE INTO 
NON-PROTErN CODING 
SEQUENCES 







LOCATE TRANSCRIPTION 
START & END SIGNALS 



DEFINITE EXPRESSED NON- 
PROTEIN CODING 
GENOMIC SEQUENCES 



PROBABLE EXPRESSED 

NON-PROTEIN CODING 
GENOMIC SEQUENCES 



FIG. 12A 



^Z 



124 



HAIRPIN DETECTOR TRAINING 
& VALIDATION 
FUNCTIONALITY 



NON-PROTEIN CODING 
GENOMIC SEQUENCES 





r ^114 


HAIRPIN DETECTOR 



± Cl. 

HAIRPIN STRUCTURES ON 
GENOMIC SEQUENCES 



FIG. 12B 



CALCULATING FOLDING 
PATTERNS & CORRESPONDING 
FREE-ENERGY 

I 

DETERMINE HAIRPIN STRUCTURES 

I zz 

DETERMINE EXTENT OF 
SIMILARITY TO KNOWN HAIRPIN 
STRUCTURES 

GROUP HAIRPIN STRUCTURES 
INTO HAIRPIN STRUCTURE 
'FAMILIES' 

SELECT BEST 
HAIRPIN STRUCTURE 'FAMILIES' 

I 

SELECT BEST HAIRPIN STRUCTURE 
FROM EACH HAIRPIN STRUCTURE 
'FAMILY' 



FIG. 13A 



JL 



126 



^ 

HAIRPIN STRUCTURES 
ON GENOMIC 
SEQUENCES 



DICER-CUT LOCATION 
DETECTOR TRAINING & 
VALIDATION FUNCTIONALITY 



J 





r r \\6 


DICER-CUT LOCATION 
DETECTOR 




i r 140 



fDICER-CUT SEQUENCE; 
FROM HAIRPIN 
STRUCTURES 



FTG. 13B 



GET DICER-CUT LOCATION OF KNOWN 
MICRO-RNA OLIGONUCLEOTIDES 



REPRESENT 3' END 
(LOCATION, SEQUENCE, 
'BULGES') 



REPRESENT 5' END 
(LOCATION, SEQUENCE, 
'BULGES') 



TRAIN 
5' END RECOGNITION 
CLASSIFIER 



TRAIN 
3' END RECOGNITION 
CLASSIFIER 



DETECT DICER-CUT LOCATION 
OF SAMPLE NOVEL 
OLIGONUCLEOTIDE 



VALIDATE BY WET LAB 



FIG. 13C 



GET 

NOVEL GAM FOLDED PRECURSOR RNA 




PREDICTED GAM RNA 



FIG. 14A 



-128 



TARGET GENE 
BINDING-SITE 
DETECTOR 
TRAINING & 
VALIDATION 
FUNCTIONALITY 



DICER-CUT SEQUENCES 
FROM HAIRPIN 
STRUCTURES 



CES~\ 

J 



POTENTIAL TARGET 
GENE SEQUENCES 




i 

POTENTIAL NOVEL TARGET 
GENES 
HAVLNG BLNDrNG-SITE/S 



FIG. 14B 



COMPARE SEQUENCES 



FILTER COMPARISON 
RESULTS 



EXPAND BINDING SITE 



CALCULATE FREE-ENERGY & 
SPATIAL STRUCTURE 



ANALYZE 
SPATIAL STRUCTURE 



FIG. 15 




FIG. 16 

^^}RJ>RF^URSOR^)N^^^^^ 



J- 



GR PRECURSOR RNA 



ll If ll GR FOLDED 

' T M 1 M ^ ^PRECURSORI 



MBit 



38 



os os on 

i i i 



-r 



MNDING^GAM^ 

SITE t 

T GAM3 -TARGET \ 



- BINDING . | V GAM2- 
SITE TARGET RNA 

T GAM2-fATGET~ ~ ~j 
J PROTEIN j 



SHE t TARGET RNA 

T gam~"aTgeT ~ ~j 

j PROTEIN ! 



FIG. 17 

^GRI BINDING SITE ^-GR \ PRECURSOR DNA 



I 

GRI PRECURSOR RN A 



| _ Gl if OH ID 



PRECURSOR 
RNA 



JUUt 

llll ff 11 H 

'! H 

| | O I ° GAM3 GR2 PRECURSORDNA-^. 

I ° I I ^~^^^jR2BMMGSTIE^^^ 



GR2 PRECURSOR RNA 



| _ GR2 FOLDED 



JUL! 



PRECURSOR 
RNA 



I O I O I O 

III 



FIG. 18 

r U6 

FIND GAM OLIGONUCLEOTIDES 



;r ^147 

FIND GR POLYNUCLEOTIDES 



v ^148 

FIND HIERARCHY OF 
GR POLYNUCLEOTIDES 



v ^149 

DEDUCE FUNCTION OF 
'HIGH' GR POLYNUCLEOTIDES AND 
GAM OLIGONUCLEOTIDES 



FIG. 19 



GAM 
DETECTION 



ANTI-GAM 
THERAPY 



INDUCED 
CELLULAR 
DIFFERENTIATION 



MODIFIED GAM 
THERAPY 



GAM 
REPLACEMENT 
THERAPY 



FIG. 20A 

GAM PRECURSOR DNA 



FIG. 20B 



GAM PRECURSOR 



DICER COMPLEX 



GAM FOLDED 
PRECURSOR RNA 




| ^GAMRNA 



-GAM RNA GAMTARGET 

H^DINGi^ 

SHE >|< 

_t , 

! GAM TARGET I 
! PROTEIN J 



•GAM PRECURSOR DNA 



GAM PRECURSOR 



DICER COMPLEX 



GAM FOLDED 
PRECURSOR RNA 



GAM RNA 



v — ANTI-GAM RNA 

GAM 
^TARGET 
^ [ RNA 

Rinding 

SITE 




GAM TARGET 
PROTEIN 



o S 1 8 1 I 1 I I I I I I 1 I 



I 

I! 
?! 



s 



§5 

S 2 



I 8 8 s 1 I I S 





lOverall 


D 


O 


00 


> 




GAM 
Detection 
Accuracy 

Group 




o 


O 


N> 
-si 


GO 


00 




Number of 
published 
hairpins 




^5 


o 

0 s 


00 

0 s 




~vj 

O 




Precision 
on hairpin 

m 1 vti i rp 




O 
00 




~vj 


Ol 

CD 


o 


Sent 


Lab valit 




Ol 

K> 






GO 


GO 
-si 


Positive 


dationof L 
GAMs 




o x 


to 




to 


-o; 

o x 


% 
success 


[uman 




| 42416 [ 


I 7876 [ 


I 11765 


I 19950 | 


I 2821 \ 




Hairpins 
in RNA 
databases 




! 1891 | 




CD 


GO 

o> 

-si 


\ 1503|| 




Hairpins of 
the present 
invention 



KNOWN 



FIG. 22A 

1 2 3 ^ 4 5 , e 



Library 

PREDICTED GAM 9 10 11 12 13 14 15 16 17 1 ft 

OLIGONUCLEOTIDES: " ™~ ™" *~ ~^ ~~ ' 

Library 

19 20 21 22 23 24 25 26 27 28 



Hil»l4 H J »l d 



1 



Library 



I EOGOJEUL 



29 30 31 32 33 34 35 36 37 38 



39 40 41 42 43 44 45 46 47 48 

ajJEaa .lira 



Library - + - + - + - + 

49 50 51 



Library 



FIG. 22B 



Ml lltlDCD 

IMUIvlDCK 


NAME 


obUUblMOb (0 IU J) 


OtUUtlMbtU 


^ 


hsa-miR-21 


1 AbO 1 1 A 1 OAbAO 1 on loll on 







hsa-miR-27b 


TTCACAGTGGCTAAGTTCTGCA 




3 


hsa-miR-1 86 


AAAGAATTCTCCTTTTGGGCTT 


J 


^ 


hsa miR 93 


AAGTGCTGTTCGTGCAGGTAGT 




5 


hsa-miR-26a 


1 OAAb 1 AA 1 l/LAooA 1 Auul/ 1 b 


^ 




hca miR 1 Q1 

nsa-mir\- 1 y i 


AAObbAA 1 OOOAAAAbOAbO 1 O 




_ 


hsa-miR-31 


GGCAAGATGCTGGCATAGCTGT 


+ 





hsa-miR-92 


TATTGCACTTGTCCCGGCCTGT 


+ 





o Aiwn/i -i q a 
bAIVIo4 1 o-A 


ATPArATTPPrArrrATTArrA 
A 1 OAUA 1 1 uUUAuuuA 1 1 AUUA 


i 


?! 


bAIVI44zo-A 


oAAu 1 1 1 oAAuOO Ibl Ibl 1 OA 




1 1 


GAM281-A 


CACTGCACTCCAGCCTGGGCAA 




— J! — 


GAM7553-A 


TAGGTAGTTTCCTGTTGTTGGG 




— i~ — 


bAIVIOOOO-A 


1 UALAo 1 bAAUUbu 1 ll> 







bAIVIzoUo-A 


T A ArrTrrATPTAPTPPArTTA 
1 AAbb 1 bOA 1 O 1 Au 1 o^Au 1 1 A 




— 1| — 


o ai\/m mo A 
bAIVI 1 Uoz-A 


PTAPAPTPAAPPTr^TTrAPrA 
O 1 AbAU 1 uAAbU 1 OO 1 1 uAuoA 




16 


GAM3431-A 


TAATACTGCCGGGTAATGATGG 






GAM7933-A 


TAG C AG CACATAATGGTTTGAA 




T7i 

18 


bAIVIozyo-A 


AAAu Ibbl OA 1 Au 1 utAuu 1 Au 





19 


OAI\/l7rtQn A 

bAM / UBU-A 


TTTpp AOAOOOOOOA ATTPTTP 

1 1 1 OOAOAbObbULAA 1 IOI IU 





20 


OAI\/IQO£ A 

(jAMoyo-A 


APPTPOPAPTT^A APAAPATTT 

AbO 1 bOOAb 1 1 bAAbAAOAl 1 1 






OAi\/i'577rt a 


A APTTA ArAr^TPPrAPr^^TP 

AAb 1 1 AAbAbO 1 LLLAbbUL 1 b 




22 


GAM337162-A 


ACTGCACTCCAGCCTGGGCAAC 






23 


bAIVloO/ o-A 


b 1 b 1 1 bbAbbAAb 1 Ub 1 O 1 1 bA 




±1 


bAMzUoo-A 


T^A ArrTrATTmnTA A 00X0 

1 OAAbO 1 OA 1 1 OO 1 O 1 AAOO 1 O 




1| 


bAM/ / /b-A 


O ATTOO AO TOO AOOOTOOOO A A 

OA 1 1 bOAO 1 OOAbOO 1 bbbUAA 






GAM8145-A 


ACATGATCTCCTCACTCTAGGA 





07 


bAIVI ZO-A 


A ATTOOTTO AAOOOAOOAAOTO 
AA 1 1 bU 1 1 bAAbbUAbbAAb 1 b 




ls> 


bAIVI / ooz-A 


TOTTTA AOTAOOTT ATTTATOT 
loll 1 AAb 1 Abb 1 IAI 1 IAIOI 





29 


GAM337624-A 


TCTAAGAGAAAGGAAGTTCAGA 





30^ 


O AIV/M yl 7Q A 

bAMI 4/y-A 


O A AOOO AOTAOOTTOTATAOTT 

bAAbbbAb 1 Abb 1 1 b 1 A 1 Ab 1 1 


i 


21 


GAM2270-A 


ATCACATTGCCAGTGATTACCC 


+ 




OAI\/l7£CH A 

bAivi / oy 1 -A 


TTOOAOTA ATTO AOTATAOOTT 

1 1 ubAu 1 AA 1 1 bAb 1 A 1 Abu 1 1 




_ 

22. 


OAI\/IQOQ£ A 
bAIVIoZoO-A 


AOTAOAOAOTOOOAAOATAOTO 
Ab 1 AbAbAb 1 bbbAAbA 1 Ab 1 O 







OAI\/IC77'3 A 

bAIVIo / / o-A 


OTAOOOTO TTTOTOOTO AOOOO 
O 1 Abbb Ibl 1 Ibl OO 1 OAOOOO 




35 


GAM33681 8-A 


TGAGGTGGGATCCCGAGGCC 






_ 


bAIVIOo04o/-A 


TOOOTAOOTAAOOOAAO 
1 bbO 1 Abb 1 AAbbbAAb 





£1 


bAIVIoo/OZU-A 


A A TO A TO ATT ATTTTO A AOTTT A 
AA 1 OA 1 OA 1 1 A 1 1 1 1 bAAb 1 1 1 A 




38 


GAM336809-A 


T AAG G C ATTTTT ATG GT 







GAM5346-A 


GCTGTTGTTAAGGGCACTTGGG 





40 


GAM8554-A 


TTCATGGGAGCAGGTGGTACAG 




ii 

lk 


bAIVIZ / U I -A 


AOTOO AOTOOAOTOTOOOTO AO 
AO 1 bOAO 1 OOAb 1 O 1 bbb 1 bAb 




4^ 


OAI\/l7Q£7 A 

OAIVI/ yo/-A 


TO AOTOO A AOOTOTOOOTOOOO 
1 OAO 1 bOAAOO 1 O 1 bOO 1 OOOb 






bAMoyi -A 


O AO A TO AO ATOO ATOOOTO AOO 

OAbA 1 OAOA 1 OOA 1 OOb 1 OAOO 




44 


GAM6633-A 


GCACTCAAGCCTGGGTTACAGA 






GAM1 9 


a n. a n. a n. t n. n. n a n. n. t p t n. t t n n t 




46 


GAM8358-A 


GATGAGGCAGCACTTGGG 




47 


GAM3229-A 


TGAGGTGGGAGAATTGCTTGAA 




48 


GAM7052-A 


C ATG T AATCCC AG CT ACTC AG G 




49 


GAM3027-A(mmu-MIR-29c) 


TAGCACCATTTGAAATCGGTTA 


+ 


50 


GAM21 (mmu -MIR-130b) 


C AG T G C AATG ATG AAAG G G CAT 


+ 


51 


GAM oligonucleotide(mmu-MIR 
30e) 


TGTAAACATCCTTGACTGGAAG 


+ 



FIG. 24 A 



EST72223 (705 nt.) 



EST72223 sequence: 

CCCTTATTAGAGGATTCTGCTCATGCCAGG GTGAGGTAGTAAGTTGTATTG 

TTGTGGGGTAGGGATATTAGGCCCCAATTAGAAGATAACTATACAACT MIR98 

TACTACTTTCCCTGGTGTGTGGCATATTCACACTTAGTCTTAGCAGTGTTGCC 

TCCATCAGACAAAGTTGTAGATGTTCCTTGGATAATTTGGACTGGAAGAAAAGA 

GACATGGAAGGGGACAGATGGTGTTTAGGGTGAGGCAGATGTCATTATAAAGT 

GACTTGTCTTTCATTAATTGGAGCATATAATTATTTTACCTTTGGGCATGAACTC 

ATTTTGCTATTCTTCAACTGTGTAATGATTGCATTTTATTAGTAATAGAACAGGA 

ATGTGTGCAAGGGAATGGAAAGCATACTTTAAGAATTTTGGGCCAGGCGCGGT 

GGTTCATGCCTGTAATCCCAGCATTTTTGGGAGGCCGAGGCGGGTGGATCAC 

CTGAGGTCAGGAGTTCGAGACCAACCTGGCCAACACGGCGAAACCCCGCCTC 

TACTCAAATACAAAAATTAGCCAGGCTTGGTGACACTCGCCTGTGGTCCCAGC 

TACTCAGGAGGCTGAGGCAGGAG AATTGCTTGAACCCAGGAAGTGGA G GAM2f 

GCTTCAGTGAGCTGAGAACACGCCACTGCACTCCAGTCCTGGGCAAC 

AGAGCAAGACTCTGTCTCAGGAAAAAAAAAG 
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